# Streaming Generation
Qwen2.5 Omni 7B GPTQ Int4
Other
Qwen2.5-Omni is an end-to-end multimodal model capable of perceiving various modalities such as text, images, audio, and video, and generating text and natural speech responses in a streaming manner.
Multimodal Fusion
Transformers English

Q
Qwen
389
8
Moondream2
Apache-2.0
Moondream is a lightweight vision-language model designed for efficient operation across all platforms.
Image-to-Text
M
vikhyatk
184.93k
1,120
Featured Recommended AI Models